Picture for Jan Kautz

Jan Kautz

NVIDIA

Scaling Parallel Sequence Models to Foundation-Scale Vision Encoders

Add code
May 30, 2026
Viaarxiv icon

Grounded 3D-Aware Spatial Vision-Language Modeling

Add code
May 28, 2026
Viaarxiv icon

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Add code
May 27, 2026
Viaarxiv icon

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Add code
May 21, 2026
Viaarxiv icon

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Add code
Apr 27, 2026
Viaarxiv icon

SpaCeFormer: Fast Proposal-Free Open-Vocabulary 3D Instance Segmentation

Add code
Apr 22, 2026
Viaarxiv icon

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Apr 14, 2026
Viaarxiv icon

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Add code
Mar 19, 2026
Viaarxiv icon

SOMA: Unifying Parametric Human Body Models

Add code
Mar 17, 2026
Viaarxiv icon

Kimodo: Scaling Controllable Human Motion Generation

Add code
Mar 16, 2026
Viaarxiv icon